Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 21

1	Working with a small dataset - semi-supervised dependency parsing for Irish
	van Genabith, Josef; Foster, Jennifer; Lynn, Teresa...
	In: Lynn, Teresa, Foster, Jennifer orcid:0000-0002-7789-4853 , Dras, Mark orcid:0000-0001-9908-7182 and van Genabith, Josef orcid:0000-0003-1322-7944 (2013) Working with a small dataset - semi-supervised dependency parsing for Irish. In: Fourth Workshop on Statistical Parsing of Morphologically Rich Languages, 18 Oct 2013, Seattle, WA. USA. (2013)
	BASE
	Show details

2	Working with a small dataset - semi-supervised dependency parsing for Irish
	Lynn, Teresa; Foster, Jennifer; Dras, Mark. - : Stroudsburg, PA : Association for Computational Linguistics, 2013
	BASE
	Show details

3	Detecting grammatical errors with treebank-induced, probabilistic parsers
	Wagner, Joachim. - : Dublin City University. School of Computing, 2012
	In: Wagner, Joachim orcid:0000-0002-8290-3849 (2012) Detecting grammatical errors with treebank-induced, probabilistic parsers. PhD thesis, Dublin City University. (2012)
	Abstract: Today's grammar checkers often use hand-crafted rule systems that define acceptable language. The development of such rule systems is labour-intensive and has to be repeated for each language. At the same time, grammars automatically induced from syntactically annotated corpora (treebanks) are successfully employed in other applications, for example text understanding and machine translation. At first glance, treebank-induced grammars seem to be unsuitable for grammar checking as they massively over-generate and fail to reject ungrammatical input due to their high robustness. We present three new methods for judging the grammaticality of a sentence with probabilistic, treebank-induced grammars, demonstrating that such grammars can be successfully applied to automatically judge the grammaticality of an input string. Our best-performing method exploits the differences between parse results for grammars trained on grammatical and ungrammatical treebanks. The second approach builds an estimator of the probability of the most likely parse using grammatical training data that has previously been parsed and annotated with parse probabilities. If the estimated probability of an input sentence (whose grammaticality is to be judged by the system) is higher by a certain amount than the actual parse probability, the sentence is flagged as ungrammatical. The third approach extracts discriminative parse tree fragments in the form of CFG rules from parsed grammatical and ungrammatical corpora and trains a binary classifier to distinguish grammatical from ungrammatical sentences. The three approaches are evaluated on a large test set of grammatical and ungrammatical sentences. The ungrammatical test set is generated automatically by inserting common grammatical errors into the British National Corpus. The results are compared to two traditional approaches, one that uses a hand-crafted, discriminative grammar, the XLE ParGram English LFG, and one based on part-of-speech n-grams. In addition, the baseline methods and the new methods are combined in a machine learning-based framework, yielding further improvements.
	Keyword: Artificial intelligence; Computational linguistics; decision tree learning; error corpora; error detection; grammar checker; Language; learner corpus; Linguistics; Machine learning; n-gram language models; natural language processing; precision grammar; probabilistic grammar; ROC curve; voting classifier
	URL: http://doras.dcu.ie/16776/
	BASE
	Hide details

4	Identifying high-impact sub-structures for convolution kernels in document-level sentiment classification
	van Genabith, Josef; He, Yifan; Tu, Zhaopeng...
	In: Tu, Zhaopeng, He, Yifan, Foster, Jennifer orcid:0000-0002-7789-4853 , van Genabith, Josef orcid:0000-0003-1322-7944 , Liu, Qun and Shouxun, Lin (2012) Identifying high-impact sub-structures for convolution kernels in document-level sentiment classification. In: Annual Meeting of the Association for Computational Linguistics (ACL 2012), 9-11 Jul 2012, Jelu, Korea. (2012)
	BASE
	Show details

5	Irish treebanking and parsing: a preliminary evaluation
	van Genabith, Josef; Cetinoglu, Ozlem; Uí Dhonnchadha, Elaine...
	In: Lynn, Teresa, Cetinoglu, Ozlem, Foster, Jennifer orcid:0000-0002-7789-4853 , Uí Dhonnchadha, Elaine orcid:0000-0003-3448-4288 , Dras, Mark orcid:0000-0001-9908-7182 and van Genabith, Josef orcid:0000-0003-1322-7944 (2012) Irish treebanking and parsing: a preliminary evaluation. In: International Conference on Linguistic Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey. (2012)
	BASE
	Show details

6	Irish treebanking and parsing : a preliminary evaluation
	Lynn, Teresa; Çetinoğlu, Özlem; Foster, Jennifer. - : European Language Resources Association (ELRA), 2012
	BASE
	Show details

7	Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities
	Hogan, Deirdre; Foster, Jennifer; van Genabith, Josef
	In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities. In: Multiword Expressions: from Parsing and Generation to the Real World (MWE). Workshop at ACL 2011, 19-24 June 2011, Portland, Oregon. (2011)
	BASE
	Show details

8	Comparing the use of edited and unedited text in parser self-training
	Wagner, Joachim; Cetinoglu, Ozlem; Foster, Jennifer...
	In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Comparing the use of edited and unedited text in parser self-training. In: The 12th International Conference on Parsing Technologies (IWPT 2011), 05-07 Oct 2011, Dublin, Ireland. ISBN 978-1-932432-04-6 (2011)
	BASE
	Show details

9	From news to comment: Resources and benchmarks for parsing the language of web 2.0
	Wagner, Joachim; Cetinoglu, Ozlem; Le Roux, Joseph...
	In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 , Le Roux, Joseph, Nivre, Joakim, Hogan, Deirdre and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) From news to comment: Resources and benchmarks for parsing the language of web 2.0. In: The 5th International Joint Conference on Natural Language Processing (IJCNLP), 08-13 Nov 2011, Chiang Mai, Thailand. ISBN 978-974-466-564-5 (2011)
	BASE
	Show details

10	#hardtoparse: POS tagging and parsing the twitterverse
	van Genabith, Josef; Hogan, Deirdre; Foster, Jennifer...
	In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 , Le Roux, Joseph, Hogan, Stephen, Nivre, Joakim, Hogan, Deirdre and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) #hardtoparse: POS tagging and parsing the twitterverse. In: The AAAI-11 Workshop on Analyzing Microtext, 8 Aug 2011, San Francisco, CA. (2011)
	BASE
	Show details

11	Improving dependency label accuracy using statistical post-editing: A cross-framework study
	Cetinoglu, Ozlem; Bryl, Anton; Foster, Jennifer...
	In: Cetinoglu, Ozlem, Bryl, Anton, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Improving dependency label accuracy using statistical post-editing: A cross-framework study. In: International Conference on Dependency Linguistics (DepLing), 5-7 Sept 2011, Barcelona, Spain. (2011)
	BASE
	Show details

12	LFG without C-structures
	Cetinoglu, Ozlem; Foster, Jennifer; Nivre, Joakim...
	In: Cetinoglu, Ozlem, Foster, Jennifer orcid:0000-0002-7789-4853 , Nivre, Joakim, Hogan, Deirdre, Cahill, Aoife orcid:0000-0002-3519-7726 and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) LFG without C-structures. In: the 9th International Workshop on Treebanks and Linguistic Theories, 3 - 4 Dec. 2010, Tartu Estonia. (2010)
	BASE
	Show details

13	Handling unknown words in statistical latent-variable parsing models for Arabic, English and French
	Attia, Mohammed; Tounsi, Lamia; van Genabith, Josef...
	In: Attia, Mohammed, Foster, Jennifer orcid:0000-0002-7789-4853 , Hogan, Deirdre, Le Roux, Joseph, Tounsi, Lamia and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Handling unknown words in statistical latent-variable parsing models for Arabic, English and French. In: SPMRL 2010 - 1st Workshop on Statistical Parsing of Morphologically-Rich Languages at NAACL HLT 2010, 5 June 2010, Los Angeles, CA, USA. (2010)
	BASE
	Show details

14	Handling Unknown Words in Statistical Latent-Variable Parsing Models for Arabic, English and French
	Attia, Mohammed; Foster, Jennifer; Hogan, Deirdre...
	In: Proceedings of the First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) ; First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) ; https://hal.archives-ouvertes.fr/hal-00702414 ; First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010), 2010, United States. pp.67-75 (2010)
	BASE
	Show details

15	Judging grammaticality: experiments in sentence classification
	Wagner, Joachim; Foster, Jennifer; van Genabith, Josef
	In: Wagner, Joachim orcid:0000-0002-8290-3849 , Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2009) Judging grammaticality: experiments in sentence classification. CALICO Journal, 26 (3). pp. 474-490. ISSN 0742-7778 (2009)
	BASE
	Show details

16	Adapting a WSJ-trained parser to grammatically noisy text
	Foster, Jennifer; Wagner, Joachim; van Genabith, Josef
	In: Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef (2008) Adapting a WSJ-trained parser to grammatically noisy text. In: ACL-08:HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 15-20 June 2008, Columbus, USA. (2008)
	BASE
	Show details

17	Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics
	Foster, Jennifer; van Genabith, Josef
	In: Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2008) Parser evaluation and the BNC: evaluating 4 constituency parsers with 3 metrics. In: LREC 2008 - Sixth International Conference on Language Resources and Evaluation, 28-30 May 2008, Marrakech, Morocco. (2008)
	BASE
	Show details

18	Parser-based retraining for domain adaptation of probabilistic generators
	Hogan, Deirdre; Foster, Jennifer; Wagner, Joachim...
	In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef (2008) Parser-based retraining for domain adaptation of probabilistic generators. In: INLG 08 - 5th International Natural Language Generation Conference, 12-14 June 2008, Salt Fork, Ohio, USA. (2008)
	BASE
	Show details

19	C-structures and f-structures for the British national corpus
	Wagner, Joachim; Seddah, Djamé; Foster, Jennifer...
	In: Wagner, Joachim orcid:0000-0002-8290-3849 , Seddah, Djamé, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef (2007) C-structures and f-structures for the British national corpus. In: Lexical Functional Grammar 2007, 28-30 July 2007, California, USA. (2007)
	BASE
	Show details

20	Adapting WSJ-trained parsers to the British national corpus using in-domain self-training
	Foster, Jennifer; Wagner, Joachim; Seddah, Djamé...
	In: Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 , Seddah, Djamé and van Genabith, Josef (2007) Adapting WSJ-trained parsers to the British national corpus using in-domain self-training. In: IWPT 2007 - 10th International Conference of Parsing Technology, 23-24 June 2007, Prague, Czech Republic. (2007)
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern